AITopics | Fountain County

Collaborating Authors

Fountain County

Compositional Generalization for Data-to-Text Generation

Xu, Xinnuo, Titov, Ivan, Lapata, Mirella

arXiv.org Artificial IntelligenceDec-5-2023

Data-to-text generation involves transforming structured data, often represented as predicate-argument tuples, into coherent textual descriptions. Despite recent advances, systems still struggle when confronted with unseen combinations of predicates, producing unfaithful descriptions (e.g. hallucinations or omissions). We refer to this issue as compositional generalisation, and it encouraged us to create a benchmark for assessing the performance of different approaches on this specific problem. Furthermore, we propose a novel model that addresses compositional generalization by clustering predicates into groups. Our model generates text in a sentence-by-sentence manner, relying on one cluster of predicates at a time. This approach significantly outperforms T5~baselines across all evaluation metrics.Notably, it achieved a 31% improvement over T5 in terms of a metric focused on maintaining faithfulness to the input.

computational linguistic, predicate, tuple, (14 more...)

arXiv.org Artificial Intelligence

2312.02748

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > Middle East > Republic of Türkiye > İzmir Province > İzmir (0.05)
(23 more...)

Genre: Research Report > New Finding (0.67)

Industry: Government > Regional Government > North America Government > United States Government (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding

Suzgun, Mirac, Melas-Kyriazi, Luke, Jurafsky, Dan

arXiv.org Artificial IntelligenceNov-14-2022

In open-ended natural-language generation, existing text decoding methods typically struggle to produce text which is both diverse and high-quality. Greedy and beam search are known to suffer from text degeneration and linguistic diversity issues, while temperature, top-k, and nucleus sampling often yield diverse but low-quality outputs. In this work, we present crowd sampling, a family of decoding methods based on Bayesian risk minimization, to address this diversity-quality trade-off. Inspired by the principle of "the wisdom of the crowd," crowd sampling seeks to select a candidate from a pool of candidates that has the least expected risk (i.e., highest expected reward) under a generative model according to a given utility function. Crowd sampling can be seen as a generalization of numerous existing methods, including majority voting, and in practice, it can be used as a drop-in replacement for existing sampling methods. Extensive experiments show that crowd sampling delivers improvements of 3-7 ROUGE and BLEU points across a wide range of tasks, including summarization, data-to-text, translation, and textual style transfer, while achieving new state-of-the-art results on WebNLG and WMT'16.

computational linguistic, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2211.07634

Country:

North America > United States > Wisconsin > Outagamie County > Appleton (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Nepal > Bagmati Province > Kathmandu District > Kathmandu (0.04)
(35 more...)

Genre:

Research Report > New Finding (0.68)
Personal > Obituary (0.46)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Consumer Products & Services (0.68)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.94)
(2 more...)

Add feedback